A Multi-Scale Learning Framework for Visual Categorization
نویسندگان
چکیده
Spatial pyramid matching has recently become a promising technique for image classification. Despite its success and popularity, no prior work has tackled the problem of learning the optimal spatial pyramid representation for the given image data and the associated object category. We propose a Multiple Scale Learning (MSL) framework to learn the best weights for each scale in the pyramid. Our MSL algorithm would produce class-specific spatial pyramid image representations and thus provide improved recognition performance. We approach the MSL problem as solving a multiple kernel learning (MKL) task, which defines the optimal combination of base kernels constructed at different pyramid levels. A wide range of experiments on Oxford flower and Caltech101 datasets are conducted, including the use of state-of-the-art feature encoding and pooling strategies. Finally, excellent empirical results reported on both datasets validate the feasibility of our proposed method.
منابع مشابه
Cortical Object Segregation and Categorization by Multi-scale Line and Edge Coding
In this paper we present an improved scheme for line and edge detection in cortical area V1, based on responses of simple and complex cells, truly multi-scale with no free parameters. We illustrate the multi-scale representation for visual reconstruction, and show how object segregation can be achieved with coarse-to-finescale groupings. A two-level object categorization scenario is tested in w...
متن کاملMulti-scale lines and edges in V1 and beyond: Brightness, object categorization and recognition, and consciousness
In this paper we present an improved model for line and edge detection in cortical area V1. This model is based on responses of simple and complex cells, and it is multi-scale with no free parameters. We illustrate the use of the multi-scale line/edge representation in different processes: visual reconstruction or brightness perception, automatic scale selection and object segregation. A two-le...
متن کاملA Framework of Hashing for Multi-instance Multi-label Learning
Multi-instance multi-label learning (Miml) is a powerful framework, which deals with the problem that each example is represented as multiple instances and associated with multiple class labels. Previous works mostly focus on accuracy, while scalability for large scale datasets has been rarely addressed. In this paper, we present a novel framework – Multi-instance Multi-label Hashing (MimlH) to...
متن کاملA Reinforcement Learning Approach for Attentional Control Based on a Multi-Modal Sensory Feedback
In this work we present a reinforcement learning framework that integrates the processing of information acquired from a multi-modal sensory system (vision and touch). Visual and Haptic features extracted selectively from input buffers are used for object categorization. In this way we can relate sensed information to actions, abstracting and providing a feedback (identification/recognition and...
متن کاملمقایسه فعالیتهای شناختی بیماران اختلال پس از استرس ضربهای و بیماران روان نژند
Abstract Objectives: This study compared some cognitive activities of two groups of patients: those suffering from post-traumatic stress disorder and those suffering from anxiety and depression. Method: 20 patients in each group were studied through semi-structured interviews, cognitive tests of learning, visual and verbal pairs associations, digit span, word fluency, learning digit, and Verbal...
متن کامل